AITopics | calibrating deep neural network

Collaborating Authors

calibrating deep neural network

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Calibrating Deep Neural Networks using Focal Loss

Neural Information Processing SystemsDec-24-2025, 10:53:47 GMT

Miscalibration -- a mismatch between a model's confidence and its correctness -- of Deep Neural Networks (DNNs) makes their predictions hard to rely on. Ideally, we want networks to be accurate, calibrated and confident. We show that, as opposed to the standard cross-entropy loss, focal loss (Lin et al., 2017) allows us to learn models that are already very well calibrated. When combined with temperature scaling, whilst preserving accuracy, it yields state-of-the-art calibrated models. We provide a thorough analysis of the factors causing miscalibration, and use the insights we glean from this to justify the empirically excellent performance of focal loss. To facilitate the use of focal loss in practice, we also provide a principled approach to automatically select the hyperparameter involved in the loss function. We perform extensive experiments on a variety of computer vision and NLP datasets, and with a wide variety of network architectures, and show that our approach achieves state-of-the-art calibration without compromising on accuracy in almost all cases.

calibrating deep neural network, focal loss, name change, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

Review for NeurIPS paper: Calibrating Deep Neural Networks using Focal Loss

Neural Information Processing SystemsJan-27-2025, 15:07:19 GMT

Weaknesses: - It's not clear from the article if weight-decay was used for the experiments, on both the Cross Entropy and the Focal Loss. Weight-Decay has an non-negligeable effect on weight norms. The curves in the plot in Fig.2 e) would indicate the use of weight-decay but this is not mentioned in the text. Mind that some learning rate schedulers remove weight-decay for low learning rate values. Could the authors please clarify this aspect?

calibrating deep neural network, focal loss, neurips paper, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)

Add feedback

Review for NeurIPS paper: Calibrating Deep Neural Networks using Focal Loss

Neural Information Processing SystemsJan-27-2025, 15:07:13 GMT

This paper was reviewed by 4 reviewers and there was unanimous agreement that the paper should be accepted (scores ranging from "marginally above threshold" to "clear accept"). I agree with the reviewers and recommend the paper be accepted. All 4 reviewers provided quite detailed suggestions for improvement in the paper and I strongly recommend that the authors carefully take these suggestions into account in revising the paper for the camera-ready version. In particular, please be sure to please take into account the reviewer suggestions (R1 and R2 specifically) on improving and clarifying the wording of your OOD claims. And please also be sure to include vanilla Focal Loss results in the main paper (R1, R2, R5).

calibrating deep neural network, neurips paper, suggestion, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

Calibrating Deep Neural Network using Euclidean Distance

Liang, Wenhao, Dong, Chang, Zheng, Liangwei, Li, Zhengyang, Zhang, Wei, Chen, Weitong

arXiv.org Machine LearningOct-23-2024

Uncertainty is a fundamental aspect of real-world scenarios, where perfect information is rarely available. Humans naturally develop complex internal models to navigate incomplete data and effectively respond to unforeseen or partially observed events. In machine learning, Focal Loss is commonly used to reduce misclassification rates by emphasizing hard-to-classify samples. However, it does not guarantee well-calibrated predicted probabilities and may result in models that are overconfident or underconfident. High calibration error indicates a misalignment between predicted probabilities and actual outcomes, affecting model reliability. This research introduces a novel loss function called Focal Calibration Loss (FCL), designed to improve probability calibration while retaining the advantages of Focal Loss in handling difficult samples. By minimizing the Euclidean norm through a strictly proper loss, FCL penalizes the instance-wise calibration error and constrains bounds. We provide theoretical validation for proposed method and apply it to calibrate CheXNet for potential deployment in web-based health-care systems. Extensive evaluations on various models and datasets demonstrate that our method achieves SOTA performance in both calibration and accuracy metrics.

artificial intelligence, deep learning, machine learning, (14 more...)

arXiv.org Machine Learning

2410.18321

Country: